Q-TRANS: Query Translation Into English
نویسنده
چکیده
Q-TRANS, which stands for Query-TRANSlation System, translates formal database queries into English to enhance the usability of both natural and formal language database access systems. Q-TRANS is designed for the database query language SQL, whose query expressions serve as an abstract representation from which an English paraphrase is generated. Q-TRANS is also intended to be part of the Transformational Question Answering System (TQA system), which provides a natural language interface for database query, analyzing and ultimately translating the English queries into SQL expressions. The concepts and methods used in Q-TRANS to arrive at a query translation are, however, independent from the TQA system except for compatibility of lexical and grammatical coverage of the paraphrar.es produced. The paraphrases generated are true translations of the SQL expressions which are the input to Q-TRANS and serve in a sense as deep structures that get mapped into English imperatives. The grammatical English structures Q-TRANS produces obey somewhat conflicting constraints in that they preserve as much of the SQL structure as necessary to reflect the internal logic to the user, and at the same time represent as natural English sentences as possible. I. PURPOSE A N D GOALS OF Q-TRANS Formal database query languages represent a welldefined, syntactically constrained, yet powerful means of accessing databases. Once users have mastered the syntax and lexicon of such languages, however, they still may not be successful in retrieving the desired data due to the amount of detail that needs to be specified, due to the requirement for exact matching of variables, or due to misconceptions about the contents and structure of the database. Should users be fortunate enough to have a natural language interface for database accessing, different problems arise, caused by the inherently vague and ambiguous nature of natural language. That is, the system's interpretation of English queries in terms of database queries can yield ambiguous or otherwise inappropriate results. In any case, whether a formal database language or English is the basis for communication with the database, users might benefit from an independent means of verifying whether the data retrieved is really what they wanted or asked for. Q-TRANS has been designed to help users resolve such uncertainties by providing an English paraphrase of the formal query, regardless of whether it was produced by the user directly or indirectly via a natural language front end system. Q-TRANS is currently being developed at IBM Research in conjunction with the Transformational Question Answering System (TQA) (Petrick 1981, Plath 1976). TQA provides a natural language interface to databases, allowing users to pose their queries in English rather than in some formal language. The English input is processed in the TQA system by computing its underlying structure using a lexicon, a transformational grammar, and a transformational parser. The underlying structure is translated into logical form, a relational calculus expression, which is then mapped into SQL. SQL is a relational database language used independently of TQA for accessing, maintaining and updating databases (Astrahan et al. 1976). The output of TQA, the natural language front end, is a formal SQL expression. The latter serves as input to the back end. System R. which executes the SQL query against the database. The SQL expression also constitutes the only direct link between TQA and Q-TRANS. Q-TRANS translates SQL expressions into English requests without utilizing the original English query or any intermediate analysis produced by TQA. The grammatical and lexical coverage of the paraphrase, however, is designed to be compatible with that of TQA. Given the Q-TRANS paraphrase, users now can check whether their original query was interpreted correctly by the system or phrased correctly by them, or in the rare event that TQA produces more than one formal query from the English input they can choose the appropriate one on the basis of the QTRANS paraphrases. The fact that the English paraphrase is directly generated from the formal database query distinguishes this work from most other research on database response generation, which tends to rely on the natural language front end and its analysis and semantic representation of the input query (McKeown 1979, Grishman 1979, Kaplan 1977) or on template matching (Codd 1978, Waltz 1978). Whether the INTELLECT system, which is commercially available, uses either of these methods for its paraphrase is not clear. The paraphrase appears to be produced by simply inserting English-like names for the column names of the database into a sentence skeleton. 11. Q-TRANS PROCESSING Three major steps are involved in producing the English paraphrase from the SQL expression. First, the incoming SQL expression is parsed by Q-TRANS, producing for the input string a hierarchical description which serves as the underlying structure in the translation process. This is accomplished by using a modified version of the SQL grammar which is provided to every user in the SQL manual. The original grammar is unambiguously context-free, and processing with it is extremely fast. The modifications that have been implemented consist of changes in category names and some regrouping of the original structures in the grammar so as to make them more compatible with the English structures to be generated. The end result after the modification is still unambiguously context-
منابع مشابه
The Exploitation of Translation in Talking Customers into Purchasing Products: A Critical Investigation of English-Persian Advertising Brochures for Household Appliances
The present study sets out to conduct a critical investigation into what linguistic strategies are exploited during the translation of English advertising brochures for household appliances into Persian to manipu- late customers to purchase the respective products. In the pursuit of this goal, it seeks to explore the val- ues that have been added to the Persian version of the translated b...
متن کاملSystem and Methods for Converting Speech to SQL
This paper concerns with the conversion of a Spoken English Language Query into SQL for retrieving data from RDBMS. A User submits a query as speech signal through the user interface and gets the result of the query in the text format. We have developed the acoustic and language models using which a speech utterance can be converted into English text query and thus natural language processing t...
متن کاملCaption vs. Query Translation for Cross-Language Image Retrieval
For many cross-language retrieval tasks, the predominant approach is to translate the query into the language of the document collection (target language). This often gives results as good as, if not better, than translating the document collection into the query language (source language). In this paper, we evaluate query versus document translation for the ImageCLEF 2004 bilingual ad hoc retr...
متن کاملEvaluating Language Resources for CLEF 2007
This is a report on our evaluations of using some language resources for the Indonesian-English bilingual task of the 2007 Cross-Language Evaluation Forum (CLEF). We chose to translate an Indonesian query set into English using machine translation technique, transitive translation technique, and parallel corpus technique. We also made an attempt to improve the retrieval effectiveness using a qu...
متن کاملDrawing Inspiration from the Quran to Open the M-game-Enhanced Avenue for Translation of Quranic Chapters
Game-based practicing of materials can be seen as a method of capturing an essence of real- life expe-rience which is commonly missing in the conventional face-to-face classrooms. To serve the L2 learn-ers'' immediate communicative needs in wider classroom and societal contexts, this study sought to place L2 English learners within an interactional social framework through reinforcing their Eng...
متن کامل